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We present a model for evolving population which maintains genetic polymorphism. By introduc- 
ing random mutation in the model population at a constant rate, we observe that the population 
does not become extinct but survives, keeping diversity in the gene pool under abrupt environ- 
mental changes. The model provides reasonable estimates for the proportions of polymorphic and 
heterozygous loci and for the mutation rate, as observed in nature. 

PACS numbers: 87.10.+e, 02.70.Lq 
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In the biological evolution process, it is known that 
the population remains polymorphic, consisting of two 
or more genotypes: Genetic variation thus persists KB. 
Balanced polymorphism stands for that the population 
consists of two or more genotypes with the rate of the 
most frequent allele less than 95%. The proportion 
of polymorphic loci, measured by electrophoresis at al- 
lozyme loci in animal and plants, takes the values in 
the range between 0.145 and 0.587 H[§. Provided that 
one allele replicates faster than others, it increases in fre- 
quency at the expense of the others. It may completely 
replace other alleles, and make the population monomor- 
phic, consisting of one genotype only. Often, however, 
the allele does not completely replace others but remain 
at a stable intermediate frequency, leaving the population 
polymorphic, although in nature the superficial similar- 
ity conceals the diversity of genotypes occurring among 
individuals within species. This is of importance in the 
light of the fact that the superiority of an allele holds 
only in certain environments, or at certain gene frequen- 
cies, or in conjunction with certain other alleles. In bi- 
ology, how to maintain polymorphism is described in a 
phcnomcnological manner by introducing such assump- 
tions as the balance of selection and recurrent mutation, 
selection balanced by gene flow, heterozygous advantage 
selection, frequency-dependent selection, and variable se- 
lection in time or space M. 

Recently, a number of models for evolving population, 
including geographic speciation and conditions for adap- 
tation of population, have been developed through the 
method of statistical mechanics |]||. In particular, Mroz 
et al. have introduced an interesting population model in 
which individuals are represented by their genotypes and 
phenotypes while the environment characterizes the ideal 
phenotype || . In this model, a random population must 
have the initial adaptation greater than a certain critical 
value in order to grow in an environment which does not 
change with time. In this paper, we attempt to describe 
genetic polymorphism, especially under the environmen- 
tal change, by means of a population model which ex- 
hibits genetic polymorphism within itself. It is shown 
that the population which displays genetically polymor- 
phic behavior is indeed advantageous for survival in the 



changing environment. 

We consider a population consisting of M individuals, 
each characterized by its sequence of genes with the size 
of the sequence given by N. Each gene is assumed to take 
two possible forms, i.e., there are two alleles denoted by 
A and a. Following Ref. ||, we consider diploidal or- 
ganisms, in which the genotype at locus a (= 1, 2, N) 
in the sequence can be A A (represented by G a = 1), 
Aa = aA (G Q = 0), or aa (G a = -1). The genotype 
of the ith individual (i — 1,2, ...,M) is then given by 
the sequence {Gf } = {G- , Gf , Gf }. We assume that 
the phenotype Fj of the ith individual simply follows its 
genotype with allele A dominant: If Gf — 1 or 0, then 
Ff = 1; if Gf = -1, then Ff = 0. The environment 
is characterized by a certain phenotype F, which is also 
represented by a sequence of N numbers, each of which 
being either or 1. We denote by m the relative number 
of l's in the ideal phenotype F. The survival probability 
Pi for the ith individual is assumed to be determined by 
the similarity of its phenotype to the ideal one F: 
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where S(F", F a ) is the Kronecker delta symbol. The av- 
erage adaptation of the population to the environment is 
defined to be 



A^M-iJ2l 



(2) 



We begin with M — Mq individuals with randomly dis- 
tributed genotypes, and generalize the adaptation model 
in Ref. M as follows: The population is not allowed to 
grow indefinitely; the maximum size of the population is 
set to be L. We first choose randomly a pair of individ- 
uals i and j, and calculate survival probabilities of these 
two individuals from Eq. (1). They may die at this stage 
with probability 1—pi and 1— pj, respectively, decreasing 
the value of M. In case that both the two survive, we 
calculate the probability for the population size 



P p = l- M/L, 



(3) 
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according to which the individual i and its mating part- 
ner j become parents. Thus the probability P p provides 
the limit of the population size, which is in contrast with 
the existing adaptation model Iq] . The parents produce 7 
offsprings and then die. The genotypes of the offsprings 
are determined from those of parents, following the stan- 
dard genetic rule. 

Before considering the effects of environmental changes 
and mutations based on this model, we first examine the 
general behavior of the model. With the initial popula- 
tion which has randomly distributed genotypes, we ex- 
amine the relation in the stationary state between to, the 
density of l's in F, and C = M/L, the concentration of 
the surviving population. As the size N of the sequence 
increases, the concentration C as a function of to is found 
to approach a step-like structure, which is similar to the 
one found in the existing adaptation model fl. 

It is further observed that m c , which denotes the the 
value of to corresponding to the concentration of the sur- 
viving population C = 0.5, depends not only on the size 
iV but also on the number of offsprings: For given N, m c 
gets smaller/larger as the number 7 of offsprings is in- 
creased/decreased. For example, in case N=15, we have 
m c = 1 for 7 < 2, and m c = for 7 > 6. As the size of 
the sequence grows, the minimum number of offsprings 
required to keep the population from extinction, i.e., to 
have m c < 1, tends to increase. Namely, for a given 
number of offsprings, the population eventually becomes 
extinct as N is increased; this is found to be indepen- 
dent of the initial concentration of the population. This 
phenomenon is similar to the error threshold in the qua- 
sispecies model 0. For given N and C, the minimum 
number of offsprings is also found to decrease as the max- 
imum size L of the population grows. This implies that 
the population with a larger value of L can survive bet- 
ter than that with a smaller value, which reflects that 
large Mq leads to less dispersion in the distribution of 
the average adaptation. 

Effects of environmental changes: It is in general ex- 
pected that the biological environment does not remain 
unchanged for a long time, making it desirable to intro- 
duce change of the environment. For this purpose, we 
choose randomly n loci among the total N loci of F at 
every I0 6 generations, and change their values. Namely, 
if the selected ideal phenotype is I, it is changed to 0, and 
vice versa. Figure 1(a) shows that the total population 
becomes extinct after three times of the environmental 
change, while the average adaptation vanishes as shown 
in Fig. 1(b). Such behavior is closely linked to genetic 
polymorphism. To examine this, we define the propor- 
tion of polymorphic loci as follows: 
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where 



a=l 



1 if (i/M)\Y:Zi G i\ < - 9 ; 

otherwise. 



(4) 



Similarly, the proportion of heterozygous loci, which have 
genotype Aa, is defined by 



M N 

iW ^-££[l-(G?) 2 ]. 



(•5) 



The proportion of polymorphic loci in natural popula- 
tion is known to take the values between 0.145 and 0.587 
while that of heterozygous loci lies in the range between 
0.037 and 0.170 §[§. Figure 1(c) exhibits that P po i y does 
not keep its value but decreases when the environmen- 
tal change takes place. This implies that the popula- 
tion adapted to the environment initially becomes more 
monomorphic as the environment changes. Figure 1(d) 
also shows that the proportion of heterozygous loci goes 
below the value in natural population as the environment 
changes. 

This monomorphic tendency can be understood in 
terms of the Hardy- Weinberg theorem (!]], which is ap- 
plicable in the absence of correlation between the loci in 
a single gene sequence. In this case, each locus in the 
sequence may be considered independent. It is further 
assumed that the population is effectively infinite in size 
and that all individuals in the population mate at ran- 
dom. Suppose that the gene frequency of allele A is p and 
that of allele a is q (= 1 — p). Since mating is random 
within the population, the frequencies of the genotypes 
AA, Aa, and aa will be p 2 , 2pq, and q 2 , respectively; this 
leads to the frequency of A among these progeny given 
by p 1 = p 2 + (2pq)/2 = p, which is the same as the al- 
lele frequency in the previous generation. On the other 
hand, when one takes into account the different fitnesses 
relative to the environment, the allele frequency in gen- 
eral changes with the generation. Recall that genotypes 
AA and Aa have the phenotype F = 1, with the fitness 
given by to, the proportion of l's in the ideal phenotype, 
whereas the fitness of genotype aa is given by 1 — to. 
This leads, after one generation, to the gene frequency 
for allele a: 



mpq 



(1 - m)q 2 



mp 2 + 2mpq + (1 — m)q 2 ' 



(G) 



and the change in the allele frequency per generation is 
given by 



Aq = q' - q = - 



(2 — to r )pq 2 
1 - (2 - m-^q 2 ' 



(7) 



which is less than zero for to > 1/2. The stationary state 
condition Aq = then yields the stationary frequency 
q = 0; this implies that allele a is replaced by A which is 
more adaptive to the environment. 

To prevent such monomorphic behavior, the system 
needs mutation. Suppose that allele A mutates to a and 
vice versa, both with the mutation rate u. The change in 
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the allele frequency of a, due to mutation, per generation 
is then given by 

Aq mut = u(l - q) - uq = tt(l - 2q), (8) 

which leads to the total change in the allele frequency of 
a per generation 

(2 — 77l~ j??C7 

Aq tot = Aq + Aq mut = -t^-tz + "(1 - 2g). 

1 — (2 — vn'~ l )q i 

(9) 

Note that Aq to t can be either positive or negative and 
that Aqtot = in equilibrium gives 

(2 - m- x )[(l + 2«)4, - (1 + - 2ug e9 + « = 0, 

(10) 

the solution q eq of which is displayed in Fig. 2 as a func- 
tion of u, for various values of m. It shows that the 
equilibrium value q eq increases from zero, regardless of 
m, as the mutation rate u grows. In this way, polymor- 
phism is easily achieved in the Hardy- Weinberg theorem 
if the effect of mutation is explicitly taken into account. 

Roles of mutation: We therefore consider mutation also 
in our model, which keeps the population from going 
monomorphic, and hence prevents total extinction of the 
population. Mutation is known to occur at random in 
the sense that the change of specific mutation is not af- 
fected by how useful that mutation would be in the given 
environment ||. We thus add the mutation mechanism 
to the generation of population as follows: Every m g gen- 
erations, we select one individual randomly, choose one 
allele in a randomly selected locus in the individual, and 
alter the allele. Namely, if the selected allele is A, it is 
replaced by a, and vice versa. The mutation rate per gen- 
eration per individual per locus is given by (m g M N) . 

We have first examined the relation between m and C 
in the presence of mutation (m g ^ 0), and found that the 
random mutation process does not alter qualitatively the 
behavior of our model. We have further investigated the 
effects of mutation on such quantities as the concentra- 
tion of the population, the average adaptation, the pro- 
portion of polymorphic loci, and the proportion of het- 
erozygous loci. We have thus measured them at time 10 7 , 
after ten environmental changes, each of which consists of 
changing n—A loci among the total iV=15 loci of the ideal 
phenotype. Figure 3 displays the behavior of those quan- 
tities with the mutation interval m g . Below the mutation 
interval m g — 3 (not seen in the scale of Fig. 3), the pop- 
ulation cannot survive the environmental change. Thus 
too frequent mutations also prevent the population from 
adapting to the environment. As the mutation interval 
is increased, both the concentration of the population 
and the average adaptation becomes larger, approaching 
rapidly the value unity, similarly to the case of the adap- 
tation model without mutation. However, there exists a 
critical value of the mutation interval, beyond which the 



adaptation drops to zero and the population becomes ex- 
tinct completely In Fig. 3(c) and (d), the proportions of 
polymorphic loci and of heterozygous loci are shown to 
have the values around 0.4 and 0.1, respectively, in the 
appropriate range of the mutation interval. These values 
of Ppoiy and Phetero, in the range of the mutation inter- 
val for survival, are consistent with the genetic variations 
observed in natural population. 

We have also considered dependence on the values of n, 
the number of ideal phenotype loci changed when envi- 
ronment changes. Figure 4 shows Am g , the range of the 
mutation interval for survival of population, as a func- 
tion of n. As the value of n grows, Am g is shown to 
decrease exponentially. When n is larger than 6, which 
corresponds to the environmental change greater than 
40%, the population cannot adapt to the changing en- 
vironment and becomes extinct in spite of mutation. It 
is known from the study of individual proteins that mu- 
tation at individual loci in general arises at the rate of 
10~ 6 to 10~ 5 per generation ||. The proper mutation 
rate (m g MN)~ l , measured in our model for n = 5, lies 
in the range between 8 x 10~ 8 and 4 x 10~ 6 , which is 
consistent with the data in natural population. 

In conclusion, we have presented a model for evolving 
population. It has been revealed that mutation in the 
population provides a variety in the gene pool and leads 
to genetic polymorphism. Such polymorphism, which 
appears commonly in natural population, plays a key 
role for survival of the population in the changing en- 
vironment. It is thus concluded that through mutation, 
the population can adapt to the changing environment, 
maintaining genetic polymorphism. The predicted pro- 
portions of polymorphic and heterozygous loci as well as 
the mutation rate are consistent with the data observed 
in nature. The genetic polymorphism has also been dis- 
cussed from the viewpoint of the Hardy- Weinberg theo- 
rem. 
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FIG. 1. Behavior of (a) the concentration of surviving pop- 
ulation, (b) the average adaptation, (c) the proportion of 
polymorphic loci, and (d) the proportion of heterozygous 
loci, as functions of time t in units of the generation, un- 
der the environmental change. In each change, n—4 loci have 
been changed among the total iV=15 loci. The initial values 
M = 1000 and Co = 0.2 have been chosen. The system is 
shown to become monomorphic, leading to complete extinc- 
tion. Lines are guides to the eye. 



FIG. 2. The equilibrium frequency q eq of the recessive 
allele as a function of the mutation rate u per generation, 
for various values of m. Regardless of m, the equilibrium 
frequency q eq increases with u, preventing monomorphic be- 
havior. 

FIG. 3. Effects of mutation in the same system as in Fig. 
1: (a) the concentration of surviving population, (b) the av- 
erage adaptation, (c) the proportion of polymorphic loci, and 
(d) the proportion of heterozygous loci as functions of the 
mutation interval m g in units of the generation, measured at 
time t = 10 7 generations after ten environmental changes. In 
the appropriate range of the mutation rate, the population 
survives the environmental change. Lines are guides to the 
eye. 

FIG. 4. The range of the mutation interval for survival of 
population, Am g , as a function of n. While the solid line is a 
guide to the eye, the dashed line represents the least-square 
fit with the slope — 1.81± 0.18. It is shown that Am g tends 
to shrink exponentially with n and vanishes at n = 6. 
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Fig. 2 
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